|
Book details / order |
MINING THE SOCIAL WEB ANALYZING DATA FROM FACEBOOK, TWITTER, LINKEDIN, AND OTHER SOCIAL MEDIA SITES |
Facebook, twitter, and linkedin generate a tremendous amount of valuable social data, but how can you find out who's making connections with social media, what they’re talking about, or where they’re located? this concise and practical book shows you how to answer these questions and more. you'll learn how to combine social web data, analysis techniques, and visualization to help you find what you've been looking for in the social haystack, as well as useful information you didn't know existed.
each standalone chapter introduces techniques for mining data in different areas of the social web, including blogs and email. all you need to get started is a programming background and a willingness to learn basic python tools.
get a straightforward synopsis of the social web landscape
use adaptable scripts on github to harvest data from social network apis such as twitter, facebook, and linkedin
learn how to employ easy-to-use python tools to slice and dice the data you collect
explore social connections in microformats with the xhtml friends network
apply advanced mining techniques such as tf-idf, cosine similarity, collocation analysis, document summarization, and clique detection
build interactive visualizations with web technologies based upon html5 and javascript toolkits
"data from the social web is different: networks and text, not tables and numbers, are the rule, and familiar query languages are replaced with rapidly evolving web service apis. let matthew russell serve as your guide to working with social data sets old (email, blogs) and new (twitter, linkedin, facebook). mining the social web is a natural successor to programming collective intelligence: a practical, hands-on approach to hacking on data from the social web with python."
--jeff hammerbacher
"a rich, compact, useful, practical introduction to a galaxy of tools, techniques, and theories for exploring structured and unstructured data."
--alex martelli, senior staff engineer, google
about the author
matthew russell, vice president of engineering at digital reasoning systems (http://www.digitalreasoning.com/) and principal at zaffra (http://zaffra.com), is a computer scientist who is passionate about data mining, open source, and web application technologies. he’s also the author of dojo: the definitive guide (o’reilly).
------------------------------------------------------------------------------------
chapter 1 introduction: hacking on twitter data
installing python development tools
collecting and manipulating twitter data
closing remarks
chapter 2 microformats: semantic markup and common sense collide
xfn and friends
exploring social connections with xfn
geocoordinates: a common thread for just about anything
slicing and dicing recipes (for the health of it)
collecting restaurant reviews
summary
chapter 3 mailboxes: oldies but goodies
mbox: the quick and dirty on unix mailboxes
mbox + couchdb = relaxed email analysis
threading together conversations
visualizing mail “events” with simile timeline
analyzing your own mail data
closing remarks
chapter 4 twitter: friends, followers, and setwise operations
restful and oauth-cladded apis
a lean, mean data-collecting machine
constructing friendship graphs
summary
chapter 5 twitter: the tweet, the whole tweet, and nothing but the tweet
pen : sword :: tweet : machine gun (?!?)
analyzing tweets (one entity at a time)
juxtaposing latent social networks (or #justinbieber versus #teaparty)
visualizing tons of tweets
closing remarks
chapter 6 linkedin: clustering your professional network for fun (and profit?)
motivation for clustering
clustering contacts by job title
fetching extended profile information
geographically clustering your network
closing remarks
chapter 7 google buzz: tf-idf, cosine similarity, and collocations
buzz = twitter + blogs (???)
data hacking with nltk
text mining fundamentals
finding similar documents
buzzing on bigrams
tapping into your gmail
before you go off and try to build a search engine…
closing remarks
chapter 8 blogs et al.: natural language processing (and beyond)
nlp: a pareto-like introduction
a typical nlp pipeline with nltk
sentence detection in blogs with nltk
summarizing documents
entity-centric analysis: a deeper understanding of the data
closing remarks
chapter 9 facebook: the all-in-one wonder
tapping into your social network data
visualizing facebook data
closing remarks
chapter 10 the semantic web: a cocktail discussion
an evolutionary revolution?
man cannot live on facts alone
hope
Author : Matthew a. russell
Publication : Oreilly
Isbn : 9789350232941
Store book number : 109
NRS 920.00
|
|
|
|
|
|
|
|
|
|